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Abstract 

We describe a representation of the Bolthausen-Sznitman coalescent in terms of the 
cutting of random recursive trees. Using this representation, we prove results concerning 
the final collision of the coalescent restricted to [n] : we show that the distribution of 
the number of blocks involved in the final collision converges as n — > oo, and obtain 
a scaling law for the sizes of these blocks. We also consider the discrete-time Markov 
chain giving the number of blocks after each collision of the coalescent restricted to [n] ; 
we show that the transition probabilities of the time-reversal of this Markov chain have 
limits as n — ► oo. These results can be interpreted as describing a "post-gelation" phase 
of the Bolthausen-Sznitman coalescent, in which a giant cluster containing almost all 
of the mass has already formed and the remaining small blocks are being absorbed. 



1 Introduction 

The Bolthausen-Sznitman coalescent, (U(t),t > 0), is a Markov process which takes its 
values in the set of partitions of N. It is most easily defined via its restriction (ll^ (t),t > 0) 
to the set [n] := {1, 2, ... , n}, for n > 1. Denote by #T^ n \i) the number of blocks of n^t). 
Then (nM(t),* > 0) is a continuous-time Markov chain whose transition rates are as follows: 
if $Tl[™l(i) = b, then any k of the blocks present coalesce at rate 

A t , fc = ( *- 2) y )! , 2 <,< 6 <„. (i.i, 

(6-1)! 

It is usual to start the coalescent from the partition into singletons, 

n(0) = ({l},{2},{3},...), 
and in this case we say that the coalescent is standard. 

The Bolthausen-Sznitman coalescent was first introduced in [3], in the context of the 
Sherrington-Kirkpatrick model for spin glasses. In |21j . Pitman demonstrated a great num- 
ber of its properties. He introduced the class of coalescents with multiple collisions (also 
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known as A-coalescents), gave a construction of them based on Poisson random measures 
and studied the Bolthausen-Sznitman coalescent as a member of this class. Bertoin and 
Le Gall [I] give an alternative derivation in terms of the genealogy of a continuous-state 
branching process. Marchal |15j gives a construction via regenerative sets. 

In this paper, we describe a new representation for the Bolthausen-Sznitman coalescent in 
terms of the cutting of random recursive trees. 

We then use this representation to prove results about the last collision of the coalescent 
restricted to [n], as n — > oo. We obtain scaling laws for the sizes of the blocks involved in 
the final collision; essentially, this collision involves one large block and one or more smaller 
blocks, whose combined size behaves like n , where U has the uniform distribution on [0, 1]. 
We also show that the distribution of the number of blocks involved in the final collision 
converges as n — * oo (for example, the probability that exactly two blocks are involved 
converges to log 2) . 

More generally, we can also consider the discrete-time Markov chain giving the number of 
blocks after each collision of the coalescent restricted to [n]. We show that the transition 
probabilities of the time-reversal of this Markov chain have limits as n — > oo, which we 
make explicit. (We observe in passing that the form of these limiting probabilities yields 
certain infinite product expansions of powers of e, which appear to be new). 

These results can be interpreted as describing a "post-gelation" phase of the Bolthausen- 
Sznitman coalescent, in which a giant cluster containing almost all of the mass has already 
formed and the very small left-over blocks are being absorbed. We also note that the tree 
representation has an intrinsic asymmetry which contrasts strongly with the exchangeability 
properties of the coalescent itself (for example, the root of the tree always represents the 
block containing 1). This makes it possible to read off properties concerning a tagged 
particle in the coalescent process directly from the tree representation. 



2 Random recursive trees 
2.1 The representation 

A tree on n vertices labelled 1, 2, . . . , n is called a recursive tree if the vertex labelled 1 is 
the root and, for all 2 < k < n, the sequence of vertex labels in the path from the root to 
k is increasing (Stanley |26| calls this an unoriented increasing tree). See Figure ^ for an 
example of a recursive tree. Call a random recursive tree a tree chosen uniformly at random 
from the (n — 1)! possible recursive trees on n vertices. A random recursive tree can also be 
constructed as follows. The vertex 1 is distinguished as the root. We imagine the vertices 
arriving one by one. For k > 2, vertex k attaches itself to a vertex chosen uniformly at 
random from 1,2, . . . ,k — 1. For a detailed survey of results on recursive trees, see Smythe 
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and Mahmoud l24l. 




Figure 1: A recursive tree on [10]. 



We can represent an infinite tree by the infinite sequence of integers {pi}i>2, where pi is the 
parent of node i. (The condition for this to be a recursive tree is 1 < p, < i for i > 2.) 

For the purposes of this article, it will be convenient also to define a random recursive tree 
on a label set {l±, I2, ■ ■ ■ , h} where • • • ,h are the blocks of a partition of [n] for some n, 
listed in increasing order of least elements. (That is, we require that l\ < I2 < ■ ■ ■ < h where 
"<" is the total order induced by the least elements of the blocks.) The tree is constructed 
in the obvious way: l\ labels the root and Ik is attached to a vertex chosen uniformly at 
random from those labelled l\, I2, ■ ■ ■ , lk-i- Call the weight of a label the number of integers 
it contains and let C n be the set of partitions of [n] . 

Meir and Moon define a cutting procedure which they apply to random recursive trees. 
They take a random recursive tree on [n] and pick an edge e uniformly at random from the 
n — \ present. This edge is deleted along with the entire subtree below it. These operations 
are then repeated until the root is isolated. The idea of cutting combinatorial trees in this 
way appears to have been introduced in Meir and Moon ^H] and remains a current research 
topic. Interest has tended to focus on the number of cuts required to isolate the root in 
different types of trees. Recent references include Janson |llM12j . Fill, Kapur and Panholzer 
[0] and Panholzer |18| I19j. In particular, 19 a treats the case of random recursive trees; the 
author's presentation of this work at the Mathlnfo 2004 conference in Vienna stimulated 
the present work. A variant of the cutting procedure will be the basis of our representation 
of the Bolthausen-Sznitman coalescent. Suppose that instead of throwing the subtree below 
e away, we add its labels to those of the vertex above e. We repeat this procedure until 
only the root remains, labelled by [n]. See Figured for an example. 

Proposition 2.1. Suppose T is a random recursive tree on L = {l\,l2, ■ ■ ■ ,h} £ £n- Pick 
an edge at random, cut it and add the labels below the cut to the label above. Then the 
resulting tree is a random recursive tree on the new label-set. 
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Figure 2: The cutting procedure for a random recursive tree on [10], with the cuts indicated. 



Proof. (Essentially due to Meir and Moon 11 .) The resulting tree is clearly recursive 
because its labels still increase along all paths away from the root. So we need to show 
that it is chosen uniformly from the set of all recursive trees with the same label-set. Put 
another way, we will show that each of the (6 — 1)!(6 — 1) recursive trees on the label-set L 
with a single marked edge corresponds to a tree constructed as follows: for some 2 < k < 6, 
pick k of the labels, say 1^,1^, ... , li k (taken to be in increasing order), make a recursive 
tree on L \ {Zj 2 , . . . ,k k }, make another recursive tree on {Zj 2 , . . . ,li k } and then join them 
together with an edge between the vertices labelled and l{ 2 . 

There are ( \ ) ways of picking the k labels l^, . . . ,h k . There are (k — 2)! ways of arranging 
the k — 1 largest into a recursive tree rooted at l{ 2 . There are (b — k)\ ways of arranging 
the b — k + 1 other labels into a recursive tree. Clearly each of the trees constructed in this 
way is distinct and also a recursive tree. The number which can be constructed is 

k=2 V 7 k=2 V ' 

Hence, the claimed correspondence holds. □ 

Proposition 2.2. Start with a random recursive tree on [n] and associate an independent 
exponential random variable with mean 1 to each edge. This exponential time is the time at 
which the edge is deleted, at which point the labels in the subtree below it are instantaneously 
added to the label of the vertex above the edge. Then the set of labels forms a partition of [n] 
which evolves according to the dynamics of the Bolthausen-Sznitman coalescent restricted 
to [n]. 



Proof. We need to show that the rate of coalescence of any set of k of the labels is 
whenever there are b vertices in the tree, for A&& defined as at The total rate of 

events when there are b vertices is 6—1. The probability that the next event will coalesce a 
particular fc-set is worked out in the same way as in the proof of Proposition 12.11 Suppose 
we start with label-set L = {h,h, ■ ■ ■ ,h} an d we want the probability that the next event 
is the coalescence of {l^, . . . ,h k }- There are (k — 2)! ways of making a recursive tree on 
{li 2 , . . . ,h k }', there are (b — k)\ ways of making a recursive tree on the remaining labels. 
There are (b — 1)1 (b — 1) recursive trees on a label-set of size b with a single marked edge 
and so the probability that we coalesce {l^, . . . ,li k } is 

(k - 2)\{b-k)\ 
(6-1)1(6-1) ■ 

Hence, the rate at which we coalesce any A;-set is 

(fc-2)!(6-fc)! 

The evolution is Markovian because, by Proposition ^. 11 the resulting tree is another random 
recursive tree, this time on 6 — k + 1 labels. □ 



5 



Because of the recursive way in which the original tree is built, the representations are 
consistent for different n: if n[ n l(t) is the partition given by the tree representation on [n] 
at time t then for all t > and all n E N, 

U^(t)\ [n] =U^(t). 

Thus we are able to define (H(t),t > 0), the Bolthausen-Sznitman coalescent on the whole of 
N, by means of the cutting procedure applied to an infinite random recursive tree (indexed 
by N). 

2.2 Random recursive trees, the Chinese restaurant process and random 
permutations 

There is a well-known correspondence between random recursive trees and uniform random 
permutations. We find it most convenient to demonstrate this connection via the Chinese 
restaurant process of Dubins and Pitman (see Chapter 3 of Pitman which we now 

introduce. 

For < a < 1 and 9 > —a, the Chinese restaurant process with parameters a and 9 
(referred to hereafter as the (a, #)-Chinese restaurant process) is constructed as follows. 
Person 1 enters a Chinese restaurant and sits at the first table. Person 2 sits either at 
the same table or at a new one; in general, each subsequent person (numbered successively 
3, 4, . . . , n) sits either at one of the occupied tables or at a new one. Suppose that people 
1,2, ... ,rn have sat at k tables where table i has nii customers (and Y2i=i m « = m )- Then 
person m + 1 starts a new table with probability 

9 + ak 
m + 9 

and sits at table i with probability 

rrti — a 
to + 9 ' 

for 1 < i < k. Once all n customers have arrived, the tables constitute the blocks of a 
partition of [re] which are consistent for different n. Letting n — > oo, these blocks have 
asymptotic frequencies, where the asymptotic frequency of a block B C N is defined to be 

The existence of this limit for blocks generated by the Chinese restaurant process is guar- 
anteed by Kingman's theory of exchangeable random partitions |13l I14j . Moreover, if 
(Fi,Fz, • • •) is the vector of asymptotic frequencies for a (a, #)-Chinese restaurant process 
in the order of the tables then (F\,F2, ■ ■ .) has the Griffiths-Engen-McCloskey GEM(a,#) 
distribution. If this vector is put into decreasing order of size then it has the Poisson- 
Dirichlet PD(a,#) distribution. (See Pitman |22j or Arratia, Barbour and Tavare for 
more details.) 
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For a = and 9 > 0, there is a simpler construction. Person 1 sits at table 1. For m > 1, 
person m + 1 starts a new table with probability 8/(6 + m) and sits to the left of each of the 
m existing customers with probability 1/(6 + m). In this case, the blocks created contain 
more information than is needed to just give a partition of [n]. In fact, they are the cycles 
of a permutation of [n], written in standard cycle notation (i.e. the permutation maps i 
to the customer seated to the left of i). The special case 9 = 1 gives a uniform random 
permutation of [n] (that is, a permutation chosen uniformly at random from all the possible 
permutations of [n]). Moreover, the permutations are consistent for different n and, when 
n — > oo, we obtain a uniform random permutation of N. 

We can find a (0, 1)-Chinese restaurant process in the construction of the random recursive 
tree on [n + 1]. For this purpose, it is easier to imagine the random recursive tree labelled 
by the set {0, 1, ... , n} rather than {1, 2, . . . , n + 1}. The root (labelled 0) is fixed and does 
not appear in the permutation. Vertices attached to the root correspond to individuals who 
start a new table. Thus, individual 1 necessarily starts a new table. If vertex k arrives 
and attaches to a vertex other than the root (say j) then individual k sits directly to the 
left of individual j. As vertex k is equally likely to attach to each of the vertices labelled 
0,1,..., fe — 1, individual k is equally likely to sit to the left of any of the individuals 
1, 2, . . . , k — 1 or to form a new table. 

Thus, a random recursive tree on [n + 1] corresponds to a random permutation of [n]. 

Just to give some interpretation to the cutting procedure in the Chinese restaurant, we 
embellish the usual model as follows. Each customer present in the restaurant has friends 
arrive at rate 1 and there is also a rate 1 stream of customers who know no-one in the 
restaurant at their time of arrival. Customers who arrive and have a friend present always 
sit to the left of that friend. Friendless customers sit at new tables. Stop the process 
when there have been n arrivals. A meal in the restaurant costs one euro. At rate 1, 
each customer decides to leave and go home. Whenever he leaves, any of his friends who 
arrived after him want to leave (and their friends, and so on). (If the person who decided to 
depart was k then it is the subtree rooted at k in the random recursive tree which departs.) 
Anyone leaving gives the money for their meal to the person to whose left k sat down on 
entering the restaurant, so that he can pay for them when he leaves. If k was, in fact, the 
first person to have sat at that table then he collects all the money for the whole table 
(plus, of course, the price of his own meal), takes it to the cashier and departs. Note that 
at any time t, the amount of money in the cash register is the same as the weight of the 
label at the root in the random recursive tree (i.e. the size of the block containing 1 in 
the Bolthausen-Sznitman coalescent). In this paper, we will answer questions such as: how 
much does the last customer to leave pay the cashier? 
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2.3 Using the construction 



The construction of Subsection 12.11 gives an oddly asymmetrical way of looking at the 
Bolthausen-Sznitman coalescent. Rather than the usual exchangeable partitions approach 
here, because the blocks are automatically ordered according to their least elements, we 
have a size-biased view (smaller numbers are more likely to be the smallest number in their 
blocks). Moreover, a particular realisation of a random recursive tree corresponds to a 
particular conditioning of the coalescent (e.g. if 2 and 5 are both children of 1 then we are 
conditioning 2 and 5 only to be in the same block when they have both coalesced with 1). 
Some facts are very easy to read off from the tree and we will describe here some examples. 

In Subsection 2.4 of Pitman discusses the size of the block containing 1 at time t. 
In terms of the random recursive tree, in order to find the size of the block containing 
1 at time t, it suffices to know only the sizes of the descendencies of all the children of 
the root, plus the times at which the edges from the root to those children are severed. 
By the connection with the Chinese restaurant process, it is clear that in the limit as 
n — > oo the vector of asymptotic frequencies of the descendencies of the children of the 
root has the GEM(0, 1) distribution (i.e. the PD(0, 1) distribution with the blocks in size- 
biased rather than decreasing order). Moreover, by construction, the times at which these 
descendencies are added to the root are independent and identically distributed standard 
exponential random variables. Thus, we have here given a very straightforward proof of 
Pitman's Corollary 16, reproduced below: 

Theorem 2.3 (Pitman). Let fi(t) be the frequency of the block containing 1 at time t in 
a standard Bolthausen-Sznitman coalescent. Then 

(i) the process (fi(t),t > 0) is Markovian, with the same distribution as the process (7(1 — 

e ~ t )/'~f(l)it > 0) where (j(s),s > 0) is a gamma process, with stationary independent 
increments and F(j(s) £ dx) = T(s)~ 1 x s ~ 1 e~ x dx,x > 0. 

(ii) The distribution of f\{t) is Beta(l — e~*,e - *), and the process (— log(l — fi(t)),t > 0) 

has non- stationary independent increments. 

(iii) Let J% > J2 > ■ ■ ■ be the ranked magnitudes of jumps of the process (fi(t),t > 0), and 
let Ti be the time when the jump of magnitude J{ occurs. Then the distribution of the 
sequence (J±, J2, ■ ■ ■) is PD(0, 1), and this sequence is independent of the Ti, which are 
independent with standard exponential distribution. 

((i), (ii) and (iii) are equivalent by properties of the Dirichlet random measure; (iii) is 
immediate from the random recursive tree approach.) 

In Bolthausen and Sznitman [3] and Pitman , it is shown that the marginal distribution 
of the asymptotic frequencies of the blocks of the coalescent (ranked in decreasing order), 
at fixed time t > 0, is PD(e~*, 0). Jason Schweinsberg has observed that the recursive tree 
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provides a simple way of proving this fact. We work via the Chinese restaurant process. 
Let Ei, Ei,-.. be independent and identically distributed standard exponential random 
variables. Fix a time t and imagine constructing the random recursive tree in such a way 
that the vertex i arrives with the edge above it marked if E{ < t and unmarked otherwise, 
for alii > 2. (These marks are the cuts, but we will find it convenient here not to collapse 
the tree at the cuts but rather just to keep track of where they are.) Then vertex i has the 
smallest label in its block if there are no marks in the path from the root to i. Otherwise, 
i is in the same block as the closest vertex to i in the path from the root to i which has 
no cuts above it. Suppose now that vertices 1,2, ... ,n have arrived (possibly with marks) 
in the construction of the random recursive tree. Suppose also that they form k blocks 
(after cutting) of sizes n\, ... ,rif. (where Yli=i n i = n )- Then vertex n + 1 has n possible 
places to attach. It creates a new block if (i) it attaches to the smallest element of one 
of the k blocks and (ii) it doesn't have a mark. This happens with probability ke~ l jn. It 
adds to the ith block (of size n^) if it attaches to one of the Uj — 1 non-smallest elements of 
the block or if it attaches to the smallest element and arrives with a mark. This happens 
with probability (ra, — e - *) jn. But these are exactly the probabilities in the (e~ * , 0)-Chinese 
restaurant process and so we can conclude that the full asymptotic frequencies in decreasing 
order must have PD(e~*,0) distribution. 



2.4 Translating results for trees into results for the coalescent 

As mentioned above, Panholzer jTH] has studied the number of cuts necessary to isolate 
the root in a random recursive tree on [n]. In the context of the Bolthausen-Sznitman 
coalescent, this corresponds to the number of collision events that take place until there is 
just a single block. Let J n be this number of collisions. 

Theorem 2.4 (Panholzer). J n has moments and centred moments as follows: for k G N, 
as n — > oo, 

L J log^n V i^i J log n \log + nj 

and 

- ((-)' +E (* 7 x ) ^-'^ + «) - ° (i^) 

where H n = £" =1 \ and = £ logics). Thus, 

log"- r v , 
n 

as n — > cxd. 

Panholzer notes that it is not possible to obtain a limiting distribution of a centred, scaled 
version of J n by the method of moments. 



E 



|J„-E[J n ] 
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3 The last collision of the Bolthausen-Sznitman coalescent 



3.1 Main results 



Let M n be the sum of the sizes (or masses) of the blocks not containing the integer 1 in 
the last collision of the Bolthausen-Sznitman coalescent restricted to [n], and let B n be the 
number of blocks involved in the final collision. For example, in Figure 121 we have re = 10, 
M n = 5 and B n = 3. 

Theorem 3.1. As n — > oo, 

where (Y(t))t>o is a standard Yule process, U is uniform on [0, 1], E is standard exponential 
and (Y(t)) t >o, U and E are independent. 



The proof of this theorem is quite long and so we defer it to Section 13,41 Theorem 13.11 tells 
us that the final collision of the coalescent restricted to [re] is between a very large block 
containing almost all of the mass and a collection of very small blocks whose total mass is 
roue hly n u . We can make the distribution of the number of the small blocks in the last 
collision explicit as follows. 

Proposition 3.2. For m > 1, 

1 m / \ 

P (Y(UE) =m) = -EU lQ g( fc + X )- 

m k=i ^ ' 

This distribution has infinite mean. 



Proof. Let q m (t) = W(Y(t) = m) for m > 1. It is easily shown (for example, from the 
forward equations) that 

q m (t) = (1 - e-')" 1 - 1 - (1 - e-T 
and so, by Proposition 13.151 fsee Section l3~5|) . we have 



q m {t) 

Thus, 



e * if m = 1 

ET=2 (r') {~l) k (e-^ - e- w ) if m > 2. 



>{Y(UE) = m)=E[q m (UE)] 

JE [e~ UE ] if m = 1 

= lEr=2 (rl) (-l) fe (IE [e-^ 1 ^] -E [ e -*>*\) if m > 2. 
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For any 6 > 0, 

E 







■ e -euE 






-fj 




Jo Jo 



1 

e -0xu e -x dxdu = + -g 



and so 

¥(Y(UE) = 1) =log2 

and, for m > 2, 



=^E(:)(-D t+i >^ +1 ). 

i 1 V / 



k=l 

Finally, as E [Y(t)\ = e*, 



E [K(C^£0] = / / e vx e~ x dxdu = / = oo 

7o Jo Jo l-« 



□ 



For example, putting m = 1 gives P(.B n = 2) — > log 2 as n — > oo. The joint convergence 
given in Theorem IM. II makes it possible to condition on the value of Y(UE) to obtain results 
such as 

Corollary 3.3. Conditional on the event {B n = 2}, 

logM n d 



log n 

where V has density rpp^) i og 2 on -*■] ■ 



V, 



Remarks, (a) We have already mentioned the connection between random recursive trees 
and uniform random permutations. In a permutation, M n represents the length of a 
uniformly-chosen cycle. In fact, considerably stronger results hold on the distribution of the 
cycle lengths. Let K n (t) be the number of cycles of length less than or equal to rv. Then 
the functional central limit theorem of DeLaurentis and Pittel jH] states that 

K„ (t) — t log n \ . 

, < t < 1 (3.2) 



n 



(a random element of D[0, 1]) converges weakly to Brownian motion on [0, 1]. (This theo- 
rem was extended by Hansen to the case of a permutation with distribution given by 
the Ewens sampling formula i.e. a permutation generated by the (0, #)-Chinese restaurant 
process. An alternative proof of her more general theorem, which is more in the style of 
this paper, was given by Donnelly, Kurtz and Tavare [7]. See also Feng and Hoppe |H] for 
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a path-level large deviations principle for the Ewens sampling formula.) 
(b) Take any rooted tree T, random or deterministic. Suppose that each edge e has a 
random variable A e attached to it, where the values A e are independent and identically 
distributed with a continuous distribution. Say that A e is a record if it is the largest value 
in the path from the root to e. Janson shows that the distribution of the number of 
records in T is the same as that of the number of random cuts required to isolate the root. 
To see this, each time cut the edge with the largest A e amongst those remaining. Then by 
symmetry, we always cut a uniformly-chosen edge, and so we get the cutting procedure. 
Moreover, A e is a record if and only if its edge is cut; thus, there must be the same number 
of records as of cut edges. In the records setting, B n has the distribution of the size of the 
maximal subtree of a random recursive tree on [n] containing the smallest record A* and 
only edges e with A e < A*. 

Proposition 3.4. If A n is the time to absorption for the Bolthausen-Sznitman coalescent 
on [n], so that 

A n = mi{t>0:m [n] (t) = l}, 

then 

A n - log log n -> - log E, 
where E has a standard exponential distribution. 

Proof. This is a corollary of Lemma 13.81 in Section 13.41 (to connect the notation, we have 
A n = £*). □ 

This proposition is straightforward, but it nicely illustrates the idea that the Bolthausen- 
Sznitman coalescent on N "only just fails" to come down from infinity (see Pitman [2j| and 
Schweinsberg [2*3*]). 



3.2 The coalescent in reversed time 



We can regard Theorem 13.11 as a statement about the first event in a time-reversed ver- 
sion of the Bolthausen-Sznitman coalescent. More precisely, let X = (Xq,Xi, . . .) be the 
discrete-time Markov chain describing the evolution of the number of blocks in the coales- 
cent restricted to [re], so that X$ is the number of blocks after i collisions, for i > 0. (We 
can do this because the Bolthausen-Sznitman coalescent is homogeneous, in the sense that 
the sizes of the blocks do not influence the dynamics of the process.) X has transition 
probabilities 

**■*-*■!= ( b -i)k(k-iy 2 -"- 6 ' (3 ' 3) 

and is clearly decreasing. Started from Xq = n, for any finite re, the chain hits 1 almost 
surely. Let X be the time-reversal of X, where X starts from 1. We show that the reversed- 
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time transition probabilities 

Plm '■= li m IP (X enters I from m\Xo = n and X hits I) 

lim P [X jumps from I to m|Xo = 1 an d X hits I and n 



n— >oo 

exist and can be made explicit 

Theorem 3.5. We have 
m— 1 



L TE( m z 1 (-l) fc+1 log(fc + l) z/m>Z = l 



TO— 1 / -i 

m — 1 



m 3=i 



(Z-l)(m-0("»-l + l) 

fe=i 



E( ^. J(-l) i+1 Mi + l) (3.4) 

i/m > i > 2. 



E(V) (-i) fc+i io g (^+i) 



The I = 1 case of this theorem is a direct consequence of Theorem 13. II and Proposition 13.21 
In principle, one could extend the probabilistic proof that we give of this case to study 
individual cases of higher I; however, we have not been able to find such a proof for the 
general case, and so the self-contained proof given in Section T3. 51 is via the analysis of certain 
recurrence formulae. 

Remark. We note that p\^ i= P2,3 and so the time-reversal of the Bolthausen-Sznitman 
coalescent is not self-similar in the sense of Bertoin j3j ■ See Pitman |21j , Basdevant |2j and 
Marchal ^S] for results on time-reversing the Bolthausen-Sznitman coalescent to obtain a 
fragmentation process. 



3.3 Number theoretic results 

Before proceeding to the proofs of Theorems 13.11 and 13.51 we observe in passing that Theo- 
rem entails various infinite product expansions of powers of e, which seem not to have 
been previously known. 

Theorem 3.6. For integers r > 1, 

oo / n \ V(n-r+l) 

n=r \k=l / 

For example, the r = 1 case put more clearly reads 

'2\ x/1 ( 2 2 \ 1/2 /2 3 -4\ 1/3 ( 2 4 -4 4 \ 1/4 

e 



1/ V 1 - 3 7 Vl-3 3 / \l-3 6 -5 
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and the r = 2 case reads 

2 2 \ 1 ^ 1 /2 3 -4\^ 2 / 2 4 • 4 4 \ ^ 3 /2 5 • 4 10 • 6\ ^ 4 



1 • 3 J V 1 • 3 3 7 \ 1 • 3 6 • 5 7 V 1 • 3 10 • 5 5 



The r = 1 case of this theorem was first proved by Guillera and Sondow (cited in Son- 
dow HHJ) and bears a certain resemblance to Pippenger's product 20 for e. 

Proof. Consider first the r = 1 case. In Theorem l3.5l we find the distribution (pi n +i, n > 1). 
Summing over n, we obtain 

oo ^ n / \ 

E^EU) (-D fe+1 io^+i) = i. 

n=l k=l ^ 



Exponentiating both sides, we have 



l/n 

e =n (n( fc+i ) m( ~ i)fc 



oo I n 



n=l \fc=l / 

which is Guillera and Sondow's product. 

We now proceed by induction on r. Assume the result up to r — 1. We have 

oo 

^ Pr,m — 1 
m=r+l 

and so, from (|3.4jl . 

Z> (r-1) 



m=r+l 



L)(ro-r)(ro-r + l) Efc=i (V) + 1) 



Hence, 



^EfV) (-l) fc+1 MA: + l) 
fc=i ^ ' 

- t (^7-^)™E("7 1 ) ( - 1) " 11o8(j + 1) 

m=r+l x / j=l x 7 

oo 1 ra / \ 

n=r j=l v ' 

oo 1 n / \ 

n=r 7=1 x 7 



5(3 + !)• 
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Shifting the last term to the left-hand side and using the induction hypothesis, we obtain 

n=r j=l ^ ' 



and taking the exponential of both sides gives the result. □ 



3.4 Proof of Theorem ETT1 

It is convenient to think of the formation of the random recursive tree as occurring in 
continuous time, according to a linear birth process with immigration. (This is the approach 
used by Donnelly, Kurtz and Tavare [Zj when dealing with random permutations.) More 
specifically, we imagine that the root is present at time and that it has a special role. 
Children of the root "immigrate" at rate 1 and each immigrant initiates a family which 
evolves as a standard Yule process. Let Ri be the time of the ith immigration and let P(t) 
be the number of immigrations up to time t (this is just a standard Poisson process). Let 
Yi(t) be the size of the the ith family at time Ri + t and let 



inf < 



P(t) 

t > : J^li(t - Ri) = n 

i=l 



the time at which the total population present (including the root) reaches n. (Of course, 
Pit) 

1 + 2Ji=i ~~ Ri) i s itself just a standard Yule process, but in what follows we find it 
helpful to distinguish the root.) Finally, let C n = P(T n ), the number of immigrations up 
to (and including) time T n . Note that P and Y±, Y2, ■ ■ . are independent. 

The theorem concerns what happens when we cut the tree so much that all of the children 
of the root become severed. We will think of the cutting as occurring in continuous time 
(although subsequent to the formation of the tree). To this end, we associate an independent 
standard exponential lifetime to each individual, which gives the time at which the edge 
above it is cut. 

For each i > 1 we define a family of processes, {(Yj thmncd ^(,s), S > 0),n > 0}, in the 
following way. The tree representing the ith Yule process at time s has size Yi(s). From 
this tree, we remove all the edges whose cutting time is less than or equal to u (and all the 
subtrees below such edges). Let y. thmned ( u )( s ) b e the size of the tree that remains. Note 

that for any u, the process (Y^ hmncd ^ u \s) , s > 0) has the distribution of (Y(e~ u s),s > 0) 
where Y is a standard Yule process. 

For 1 < i < C n , let E± be the exponential lifetime associated with the ith child of the root. 
The last child to be cut is severed at time 

E* = max Ei 

Ki<C n 
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and is the C*th child, where 

C* = argmaxE'j. 

i<i<C n 

E* is the time at which the root is isolated, and is therefore the time of the last event in the 
coalescent process. Let R* = , the time at which the C*th child arrived in the formation 
of the tree. 

In this framework, we have that 

M n = Y Ct {T n -R*), 

the total family size (in the unthinned tree) of individual C* (which is the last child of the 
root to be cut). Similarly, 

B n -l = Y^ nned{E *\T n -R*), 

the number of nodes still remaining in the (thinned) family of C* at the moment when it is 
cut. 

Let t~ = logn — (logn) 1 / 2 and i+ = logn + (logn) 1 / 2 . Let 

E~ = max Ei, 

l<i<P(tn) 

C~ = argmax Ei 
i<i<P{t~) 

and 



R * ~ R C* ■ 

For completeness, if P(t~) = (which becomes very unlikely as n grows) then let E~ = 0, 
C~ = 0, and set R~ equal to a uniform random variable on [0,i~]. 

Now define 

Gn = [Tn € (t-,4), = C- and Y^ ncd ^(t- - I£) = y^in„cd(^) (t + _ 

On the "good set" Q n , we have E* = E~ and R* = R~ almost surely. Also, R~ ~ U[0,i~] 
and is independent of E~ and of the Yule processes Y\ for i > 1. We will prove the following 
three lemmas: 

Lemma 3.7. As n — > oo, 

logY c -(t--R-) d i 

R* 

\ogY c -(t+-R~) d 
++ — p- ~^ ' 
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and 

tn R* 

Lemma 3.8. For any n, 

t-e~ E * =EAt~, 
where E is a standard exponential random variable. 
Lemma 3.9. As n — > oo, ¥ (Q n ) — ► 1. 

To show how the lemmas give the theorem, we proceed as follows. On Q n , we have 

logY c -(t--R-) < l og M„. < logY c -(t+-R-) 
logn ~~ logn ~~ logn 

and 

B n -l = Y t ^ E *\r n -R:). (3.6) 
So to show that (log M ra / logn, B n ) — > (£7, 1 + Y(UE)), it is enough to show that 
(\ogY c -(t--R-) ~ ^ Y ^^ - R^)) ^ (U^YiUE)) . (3.7) 



log n log n 



We can write (t n — R if )/t n = U and t„ e E * = E At n , where U ~ U[0, 1] and £ ~ Exp(l), 
and U, E and Y\, Y%, . . . are independent. 

Using Lemma 13.71 and the fact that t~/logn — * 1, we can rewrite the left-hand side of (|3,7[) 
as 



(A n U,B n U,Y^ eA ^\t-U) 



where A n — > 1 and -B n — > 1 as n — » oo. Thus, using Slutsky's Lemma, it is enough to show 
that 

(U, U, Y ^ nnediE *\t-U)^ 4 (U, U, Y(UE)) . (3.8) 

Recall that we can replace the process ^y^ inned (- B * -*(s),s > 0) by (Y(e~ E * s),s > 0), and 
that t~e~ E * = E At~. Then the left-hand side of ()3.8|) becomes 

(U,U,Y(U(EAt-))). 

Since i~ — ► oo, this indeed converges in distribution to (U, U, Y(UE)), as desired. 

It remains to prove the three lemmas. 
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Proof of Lemma \3.1\ For a standard Yule process Y, 

P (Y(t) = m) = (1 - e~ l ) m - 1 - (1 - e~*) m 
and so, for any < e < 1, 

E (1 - 6, 1 + e]\ = (1 - e-^ (1 " £)t J - (1 - e-^ (1+£)t J, 

which converges to 1 as t — > oo. Thus, 

log y(t) d 



i, 



as t — ► oo. Hence, if V n is a sequence of random variables such that — > oo as n — > oo (in 
the sense that P (V^ > x) — > 1 as n — ► oo, for all x), then also 

log y 04) d 



i, 



as n — > oo. 



The first two parts of the lemma follow, since ~ Z7[0, t n ], and so t n — R^ and f+ — R* 
both converge to infinity in distribution. 

The last part is immediate from the facts that R~ ~ U[0, t~\ and that t~/t^ — ► 1. □ 

Proof of Lemma \3.8L Think of P now as a marked Poisson process: points arrive at rate 
1 and the z'th point carries the mark Ei, where the Ei are independent and identically 
distributed with standard exponential distribution. So, for s > 0, points with mark greater 
than or equal to s arrive as a Poisson process of rate e~ s . The quantity E~ is the largest 
mark out of all those points arriving in the interval [0, t~]. So 

P [E~ < s) = P (no mark > s arrives in [0, t~]) = exp(— i~e _s ). 

So for x < t~, we have 



t-e~ E * >x)=F IE- < - log ( — 

Also, E~ > with probability 1, so t~e~ E * < t~ with probability 1. Thus, indeed, t~e~ E * 
has the distribution of E A t~, as required. □ 

Proof of Lemma \'J. M The time, T n , of the (n— l)th birth in a standard Yule process satisfies 

T n = Ei + -E2 H 1 -E n _x, 

2 n — 1 
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where E\, E2, ■ ■ ■ , E n —\ are independent and identically distributed standard exponential 
random variables. Hence, E [T n ] = logra + 0(1) and var (T n ) = 0(1), so Chebyshev's 
inequality gives 

P(r n €(*-*+))-!, (3.9) 



as n — > 00. 



Suppose now that T n £ (t~,t+) but that C* 7^ C~ . This implies the event that, out of 
all the children of the root arriving in [0, t+], the one with the largest exponential lifetime 
arrived in The probability of this event is simply the ratio (i+ — t~)/t£, which 

converges to as n — > 00. Thus, we have 

P (C, + C~,T n € (*",*+)) -» 0, (3.10) 

as n — > 00. 

Finally, we need to show that 

F thinned(^-) (t - _ = ^thinned^" ) {f + _ ^ ^ ^ 



As observed earlier, we can replace the process so we need to 

show that 



Y(e- E *(t- - R-)) = Y(e- E *(t+ - i?~)), 
As before, we can put t~ — R~ = t~U and e~ E * = (E At~)/t~ to rewrite this as 

Y (U(E A t-)) = Y (U{E A 0§^j|)) "» 1- 

Now P (£ > t~) -> and (i+ - R*)/(t~ - R~) -i- 1 as n 00 (by Lemma Ejl , so it will 
be enough to show that 

P (a jump of Y occurs in the interval (UE, (1 + e)UE)) — > 0, 

as e I 0. This is easily seen to be true (for example, by integrating over U and E), since 
E [y(i)] is continuous in t. 

Putting 1)3 .9|) . ()3.1(Jj) and Q3.11[) together, we have that P (£/ n ) — > 1 as n — ► 00, as required. □ 



3.5 Proof of Theorem 1331 

The self-contained proof of Theorem 13.51 which we give here mostly consists of the analysis 
of certain recurrence formulas via generating functions (see Wilf [28] for an excellent intro- 
duction to this method). We will start by proving the I = 1 case and will then see that this 
enables us to prove the rest of the theorem. Let 



x 



M _ p ( X enterg 1 from m |x = n) 
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for m > 2 (we do not need to condition on X entering 1 ever as this occurs almost surely). 
Then, 

x m ^ = Pm.l = 7 Ty) (3-12) 
(m — ly 



and, by the Markov property, for n > m + 1, 

n— m+1 

n v - 

" fc(fc-l) 



n— m+1 n—m+1 (m) 

(m) _ V- (m) _ W V- g n-fc+l ,3 

fc=2 fc=2 



We can re-phrase the Z = 1 case of Theorem 13.51 as 

Theorem 3.10. Let m>2. If (x^ m) ,n > m) satisfies l/Hlgj) and ^HUBj) then 
lim ,M ~E f 7 " r ("l) fc+1 Iog(fc + 1). 

n— >oo TO — 1 z — ' \ K I 

k=l v 7 

The generating function of the sequence (x^ , n > m) we will use is defined as 

oo 

f {m) {t) = Y J ^ m -it n+m ~\ (3.14) 

n=l 

for t G [0,1). 

The key to proving Theorem 13. 1UI is the following result on the convergence of power series. 

Proposition 3.11. Suppose that f(t) = X^nLi fn~t n * s a power series with positive real 
coefficients such that 

|/n-/n-l|<-^T (3-15) 

71—1 

for some constant C £ (0, oo) and all n>2. Suppose that 

(1 - t)f(t) - /* (3.16) 

as t -> 1-. T/ten lim n _»oo f n = f*. 

Proof. We have (1 — t)f(t) = Y^=i(fn — fn~i)t n , where we take /o = for convenience. 
Then by Littlewood's Tauberian theorem (see, for example, Theorem 1.1 of Chapter 2 in 
van de Lune |2*7| ) 

oo 

/* = '^2(fn - fn-l)- 

71=1 

This sum telescopes and so we have lim ri ^ 00 /« = /*, as required. □ 
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In view of Proposition 13,111 we would like to prove a condition like 1)3. 15j) for the sequence 
(xi m \n > m). It turns out to be easier to prove this for a more general situation. Let the 
sequence (x n ,n > m) be defined as follows for fixed m > 2: for some x m G (0, 1], 

n—m+l 

x n = ^2 -j^Xn-k+i, n>m+l, (3.17) 

fc=2 

where the non-negative coefficients (a n ^k)2<k<n satisfy 

a n ,k = n ^ 1 a n+1>k + ^a n+hk+1 , 2<k<n (3.18) 

and 

K = a«,2 + a n ,3 H 1" a n,n, n>2. (3.19) 

Conditions ([3.18ft and (|3.19|) are exactly those satisfied by the rates in a general A-coalescent 
and (|3.17l) is the recurrence that arises for its reversed-time transition probabilities. In 
Pitman's notation of [2Tj, a„ )fc = {k)K,k and b n = Ylk=2 (k) K,k, where \ n)k = J Q x k ~ 2 (l- 
x) n ~ k A(dx) and A is any finite measure. In the Bolthausen-Sznitman case, a n ^ k = fcprrn 
and b n = n — 1. 

Lemma 3.12. For n > 3, b n = 6 n _i + \a n ,2- 



Proof. We have 



b n -l — O-n-1,2 + o-n-1,3 + • • ■ + a n _i jn _i 

= + jjO^s) + (^«n,3 + ^,4) 

"! !~ (^ a n,n-2 + ^On.n-l) + (^n.n-l + O n , n ) by (|3.18j) 

= —n a n,2 + «n,2 + On,3 H T CL n ^ n 

l, 2 
— On — ^°n,2- 

□ 

Lemma 3.13. For n > m + 2, 

. f n— m ~| 

2^n— 1 — T~~ \ ^ ^ n Q / n,k(%n—k X n — fc+l) ^ ^n,n-m+l^m f • (3.20) 

° n U=2 J 



Proof. We have 
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and 



bn—l%n—l — 0"n— l,2*£n— 2 ^n— l,3*£n— 3 ' ' ' Q>n—l,n—m%m 



_| L ( Hl n _|_ n-m+l , 



bn%n Q"n,2%n—1 Q>n,3%n—2 ' ' ' Q"n,n—m-\-l%m 

+ ^-^-an,2 + „ a n,3j ^n-2 H r a n>n _ m H a nin _ m +ij X r 

Kx n - \a n ^x n -x + H ^a ni2 (x„_ 2 - x„_i) + ^a n)3 (x n _ 3 - x n _ 2 ) 
"I - ■ ■ ■ H ^"Q , r) , jn _ m (3; m x m +i) — (i n n — m j r \x m . 



Hence, 



— ^n^n H Q"n,2(%n— 2 2<n— l) "I - ' ' ' H ^"n,n-m(^m ^m+l) ^ "n^-m+l'-m 

and so, by Lemma 13. 121 

Xfi—l X n — —— •[ n On, 2 (-En— 2 ^n- 1) "I - ' ' ' H ^1n,n-m(^m ^ra+l) ^ ^n,n-m+l^m} 
On 

which gives (j3~2T)|) . □ 

Lemma 3.14. For n > m + 1, we /iai>e 

\x n -i-x n \< r- (3.21) 

n — 1 

Proof. We proceed by induction. For n = m + 1, by (|.3.17|) we have 

_ _ A _ flm+i,2 ^ 1 

^ra ^m+1 — ^ra I J- . I 
V °m+l J 

Now suppose that (|3.21j) holds up to n — 1. From l|M.2()|) and this induction hypothesis we 
have 

' n—m 



„ _ „ I _}_ J \ r n— k m I m— 1 „ 

-t-n-1 x n| ^ ^ S ^ n u n,k n _ k T n u n ,n-m+l 



fc=2 



T~ { 7T ( a ",2 + ' ' ' + fln,n-m) + ^ ! ^^an, n-m+l } 
On 



1 ra 

1 m \ - 

n k=2 

m 

< 



n — 1 

Hence result. □ 
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Thus, we know that for the sequences (a4 ,n > m) associated with the Bolthausen- 
Sznitman coalescent, 

\Jrn) _ ( m ), rn 



n-1 n ' — n —l 

for all n > m + 1. 

As a final preparation for the proof of Theorem 13. 1UI we mention a simple identity. 
Proposition 3.15. For m > 2, 

m / _ o\ 

(1 _ e -r )m -i _ (1 _ e -ryn = £ " ' (-l) fc (e~ (fc - 1)r - e" fcr ). (3.22) 

fc=2 ^ ' 

Proof. By induction on m. □ 

We are now ready to prove Theorem I3.1UI 

Proof of Theorem \S.l(A Recall that for m > 2 and t £ [0, 1), 



/(m)(*) — E ^n+jn-i^"" 1 



r M + n+m-2 
n=l 

Then 

oo 

/[»)(*) = £(« + m - 2 )4Tl-i* n+m " 3 - 

re=l 

From (|3.13|) . we have 

n-m+l (m) 



fn - l)x (m) = n V n ~ fc+1 

fc=2 

and so 



n=l 



(m - l)t 



m— 1 



oo n (m) 



( m _i) 2 +2^ n + m ^ Z^fc( fc _l) 

v y n=2 fc=2 V 7 

, m _l oo n k _i 



T + E E(» + - - * - ^SU^*- 1 



m-1 7 n+m - fe fc(Jfc-l) 

n=2 fe=2 v y 

oo n j.k—1 

+EE*U n+m ""'b- (3 - 23) 

n=2 k=2 
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Now, 



00 ik 



and 



00 

Et = -mi-*) 
fe=i 



and so (|3.23j) becomes 



= —[ + (C 1 " *) 1 °e( 1 "*) + <) /m (0 " f(m)(t) io g (i - 0- 

Re-arranging and integrating gives 



1 /•* u m ~ l 



(1 - t)f( m )(t) = 7 / ; — t, ~du 

y ' m-lj log(l - u) 



I y- log(l-t) ^ _ e -ryn-l — (1 — e~ r ) m 



TO — 1 

where we have changed variable with u = 1 — e~ r . By Proposition 13.151 we have 

1 (m - 2\, , , k f~ (e-V*-^ - e~ kr 

k=2 



dr. 



(1 _ f)/(m)(() ^g(™ :2 y 1)tj r 

Letting t — > 1— , the integral on the right-hand side becomes Frullani's integral and so 
Mm (1 - ()/wW . -L. £ (»;|) (_!)* ,„ g 



t- 

fc=2 
m— 1 

1 

TO 



fc=l 

Thus, by Proposition 13.111 we have 

m— 1 



^tEV'^^-I^^ + I). (3-24) 



ft, m = lim 4-) = 1 ^ ( m lN \ (_i)*+i log(fe+l). 

n— nso TO — 1 \ K I 

k=l V 7 



□ 



It remains only to show how the rest of Theorem 13.51 follows from Theorem 13.101 We first 
note that 

P (X enters Z from uiIXq = n) 
F (X enters I from to|Aq = n and A hits <) = - 



P(Jf hits Z|X = n) 
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Let 

y(Z,m) _ p enters I from m\X = n) 

and 

y® =P(X hits l\X = n). 

Then 

..(l,m) = m I 

am 



(m — 1) (m — I + l)(m — /) 
and the Markov property gives us that 

n-m+l 
(l,m) _ n \ " 



- , ,", "" r .\ . / / > III + 1. 



^ k(k-l)' 

Thus, using Q3.12[) and (|3.13j) . yn' m ^ = ( m ^^i)"^-/) x ""^ anc ^ so by Theorem I3.1U1 

m-l , x 

- (m - l+ " )(m -o ? C"* ) ( - 1)t+Ilog(t + ^ 



Likewise, yP = 1 and for n > Z + 1, 



n-i+i (0 

„(0 = n v yw - fc+1 

Un n-1 ^ k(k-l)' 

k=2 K ' 



Thus, yfjp = (I — l) 2 x$ and so 

l-l 



fc=i 

Hence, finally, 

^• m) _ m Ef^i 1 ( V ) log(j + 1) 



PLm = lim 



n^oo y (0 " (/_l)( m _/)( m _/ + l) ^ fc -\(i-l) log(ife+l) ' 

It is clear from our argument that pi m must be non-negative for all m > I > 1. However, 
by Fatou's lemma, we know only that 

oo 
m=Z+l 

We would like to show that (pi m , m > Z + 1) is really a distribution (i.e. that this inequality 
is in fact an equality). The only thing that can go wrong is if mass "escapes to infinity". 
To show that this does not happen, we firstly make clear the probability space on which 
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we are working. For each state re, sample d n independently from the distribution {p n ,ki 1 — 
k < n — 1). Then for any re, the path of X started with Xq = n is n,d n ,dd n , • • •• Thus, 
we have coupled the paths of the Markov chain X started from all different possible states. 
Now for fixed I > 1 consider the events 

E n := {X enters I from re} = {d n = I}, 

forre>/ + l. We have P (E n ) = p n<l = ^ (n _, + * )(n _ ■ As EZi+i {n -i) { n-i + \)(n-i) < 
by the Borel-Cantelli Lemma, P (E n i.o.) = 0. Hence, we must have 

oo 
m=l+l 

Remark. It does not appear that the methods of this section can be extended to a more gen- 
eral A-coalescent, despite the encouraging fact that Lemma 13 . 141 holds for all A-coalescents. 
The generating function methods used rely crucially on special structure of the Bolthausen- 
Sznitman case, in particular the fact that p n:n _k+i decomposes neatly into two simple 
factors, one involving only n and the other involving only k. 
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